EIGENVECTORS OF TENSORS AND ALGORITHMS FOR 
WARING DECOMPOSITION 



LUKE OEDING, GIORGIO OTTAVIANI 

Abstract. A Waring decomposition of a (homogeneous) polynomial / is a 
minimal sum of powers of linear forms expressing /. Under certain conditions, 
such a decomposition is unique. We discuss some algorithms to compute the 
Waring decomposition, which are linked to the equations of certain secant 
varieties and to eigenvectors of tensors. In particular we explicitly decom- 
pose a cubic polynomial in three variables as the sum of five cubes (Sylvester 
Pentahedral Theorem). 



1. Introduction 

In this article we shall be concerned with the following general problem: given 
a polynomial, what is its minimal decomposition as a sum of powers of linear 
forms? 

Let V be a complex vector space of dimension n + 1 and let S d V denote 
the space of d th -order symmetric tensors. A choice of basis {xq, . . . ,x n } for V 
induces a natural choice of basis for S d V and allows one to express / G S d V as a 
polynomial in the variables Xj. Because of our interest in tensor decomposition, 
we often do not make a distinction between a symmetric tensor and a polynomial, 
and often use the terms interchangeably. 

Let / G S d V. In this paper, our focus is on the symmetric tensor decomposi- 
tion 

r 

(1) / = 

i=l 

where ^67 have degree 1 and q G C. For historical reasons, we will refer to 
this as a Waring decomposition or simply a decomposition of /. The minimum 
number of summands occurring in such a decomposition is called the (symmetric) 
rank of /. 

Remark 1.1. Note that when working over C, the constants c, in ([1]) may be 
assumed to be equal to 1, however this ambiguity will become important in our 
algorithms as initially we will only find the classes of the linear forms [vi\ G FV 
(the projective space of lines in V) and we will have to solve an easy linear 
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system to find appropriate q to resolve this indeterminacy in order to complete 
our decomposition algorithms. 

Because of the vast amount of work done in this area by several authors in 
diverse areas of science, the Waring decomposition is also known by many other 
names such as canonical decomposition (CANDECOMP/PARAFAC (CP)), rank- 
1 decomposition, sum of powers decomposition, and so on. 

We start with an example that illustrates the main results of this paper, which 
are Algorithm H] and Thm. 14.71 A classical result, attributed to Hilbert, Rich- 
mond and Palatini, states that a general form / £ S 5 C 3 can be decomposed in 
unique way as the sum of seven fifth powers. For reference, see |RS00j or |CSllbj 
and the references therein. 

Set for the moment V = C 3 . In the formula ([1]) with r = 7, d = 5, we want 
to find an efficient algorithm to find the u$ £ C 3 and q £ C which give the 
decomposition of /. To / we associate a linear map 

P f : Hom(SV, V) ->■ Hom(V, S 2 V), 

where Hom(S 2 V, V) is the space of linear maps from S 2 V to V. Note that the 
target space is the dual of the source one. If / = v 5 then the definition of P v s is 
the following 

(2) P v5 (M)(w) = (M(v 2 ) AvAw) (v 2 ), 

where M £ B.om(S 2 V, V) and w £ V. In general, if / is any element of S 5 V, the 
definition of Pf is extended by linearity (which can always be done because S d V 
has a basis consisting of powers of linear forms) . 

To see the map Pf explicitly as a matrix, consider the monomial basis {%iXj}o<i<j<2 
of S 2 V. It turns out that with an appropriate ordering of the basis, Pf is rep- 
resented by the following 18 x 18 block matrix, where the nine depicted blocks 
have size 6x6 



(3) 



C h -C h 

' C h C fo 



and Cf i is the matrix whose entries labeled by ((j, k), (p, q)) (for < j < k < 2, 
< p < q < 2) are given by the fifth derivative fikjpq- 

If / has rank 1, we may choose coordinates such that f = Xq. In this case 
rankC/ = 5qi and the matrix P x s has rank 2. By the linearity of Pf in the 
argument / and the sub-additivity of matrix rank, if / has rank r, Pf must have 
rank < 2r. In fact, we will see that in addition to giving this bound on rank, Pf 
will also be the key to an efficient way to decompose /. 

Now we have the following result (see Lemma [3.31) : P v s(M) = if and only 
if there exists A such that M(v 2 ) = Xv. Such v is called an eigenvector of the 
tensor M (or simply eigenvector of M, when the context is clear), where the 
homomorphism M: S 2 V — > V is viewed as a tensor in S 2 V* ® V. 



1 see Remark |2. II for a discussion of the term general 



EIGENVECTORS OF TENSORS AND ALGORITHMS FOR WARING DECOMPOSITION 3 



Eigenvectors of tensors were introduced and studied in [Lim05l Qi05| . Re 



cently, Cartwright and Sturmfels |CSlla] have found a formula computing the 
number of eigenvectors of a tensor M G Hom(S' m C n , C n ). (In Section[6]we review 
their formula and propose a geometric interpretation of this formula that leads to 
an alternative proof which generalizes to other cases.) The Cartwright-Sturmfels 
formula says that there are exactly seven eigenvectors {v±, . . . ,Vj} of a general 
tensor in Hom(S' 2 C 3 , C 3 ). Later we will see that general elements of the kernel 
ker Pf share the same eigenvectors. These seven eigenvectors appear exactly in 
the decomposition (Qp. This is the basic novelty of this paper with respect to 
|L011j . where the same example was considered. Our methods also provide a 
solution to the pentahedral example (see Section I3.2[) . which was left open in 
[LOTT] . 

In our case, choose a basis {2/0? 2/1? 2/2} of V and its dual basis {x ,Xi,x 2 } of 
V*. Any M G S 2 V* (g) V can be written as Y^i=oyiQi{ x ) wnere Qi is a quadratic 
polynomial. Recall that v is an eig envector of M G S 2 V* ® V if M(v 2 ) At) = 0, 
so the coordinates of the seven eigenvectors can be found by the vanishing of the 
2x2 minors of the matrix 



(4) 



X X\ X 2 

Qo qi <?2 

Now we summarize the steps to compute the decomposition of general plane 
quintics, / G S 5 C 3 : 

Algorithm 1. 
Input: / G S 5 C 3 . 

(1) Construct the matrix P f : Hom(S 2 C 3 , C 3 ) -> Hom(C 3 , ,S 2 C 3 ) as in ©. 

(2) Compute ker Pf. Choose a general M G ker Pf and write M = Yli=o Viqi( x ) 
as above. 

(3) Find eigenvectors {vi, . . . , v-?} G C 3 of M via the zero-set of the 2x2 
minors of ([!]). 

(4) Solve the linear system / = Yli=i c i v t i n ^he unknowns q G C. 
Output: The unique Waring decomposition of /. 

In this article, our approach to Waring decomposition uses algebraic geometry, 
starting with the classical Sylvester algorithm and the notion of eigenvectors of 
tensors. With the aid of recent progress |LQ11] , on equations of secant varieties 
using vector bundle techniques, we are able to go further. In fact, this paper 
can be seen as a constructive version for the symmetric case, of the techniques 
developed in |L011j . The main result of this paper is a new algorithm for efficient 
Waring decomposition of symmetric tensors, stated in general in Algorithm HJ 
This algorithm is a consequence of the geometric facts contained inProp. I4.1l and 
Thm.gTl 

Of course our algorithm will not always succeed to produce the Waring decom- 
position of symmetric tensors unless the rank is sufficiently small and the tensor 
is general. On the other hand, we can give precise bounds for the maximum rank 
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of a symmetric tensor which can be decomposed via our algorithm. In Thm. I2.4[ 
we give an improvement to the Iarrobino-Kanev bound on the applicability of 
the catalecticant method for Waring decomposition. Going further, we state in 
Theorems 13.51 and 15.41 sufficient conditions for the success of our algorithm in 
the cases of symmetric tensors on 3 or more variables, respectively. We give a 
further discussion of what happens in the case that our algorithm fails in Re- 
marks 14.61 14.81 In particular there is at least one case where our algorithm fails 
to decompose a tensor, but, as a side effect, brings to light a new way to express 
a rational quartic curve through 7 (given) general points, see Remark 15.51 

In addition to finding a new algorithm for Waring decomposition, and bounds 
for its success, we find a new proof of a result of Cartwright-Sturmfels (Proposi- 
tion 16. ip on the number of generalized eigenvectors that uses Chern classes and 
generalizes to other types of generalized eigenvectors (seeProp. 16. 2p . that we call 
simply eigenvectors of tensors. 

The use of tensors is widespread throughout science and appears in areas such 
as Algebraic Statistics, Chemistry, Computer Science, Electrical Engineering, 
Neuroscience, Physics and Psychometrics. A common theme at the 2010 confer- 
ence on Tensor Decomposition and Applications in Monopoli, Italy was the need 
for efficient and reliable algorithms to perform tensor decomposition. Indeed, we 
are certainly not the first to consider this problem. For a sample of related recent 
progress on tensor decomposition, see [BCMT101 ICM961 IBB111 IBGI111 IBBlOj . 
Our aim is to use algebraic geometry as a basis for algorithms that can be used 
(either in place of or in combination with the previous algorithms) to improve 
efficiency and robustness. We make some comparison with our methods and 
those of [BCMT10] in Remark I3TTU1 

Kolda and Mayo |KM10] have recently studied an efficient way of computing 
eigenvectors of tensors, analogous to the usual iterative procedure to compute 
usual eigenvectors of matrices. Further work has been done by Ballard, Kolda and 
Plantenga |BKP11] to implement this method and they have achieved significant 
speed-ups utilizing a GPU when eigenvectors of many small tensors are to be 
computed. Since our tensor decomposition methods use eigenvectors of tensors, 
this indicates that these methods could be combined with our algorithms to 
improve efficiency. 

Another aspect of using tensor eigenvectors in our algorithms is that it may 
be reasonable to try (in the tensor setting) to mimic a method to approximate 
a matrix by one of lower rank via eliminating the eigenspaces corresponding to 
small eigenvalues. We hope that this article can serve as a starting point for such 
a study. For more issues regarding low-rank approximation of tensors, and the 
well-posedness of this problem, see |dSL08j . 

We have structured this article for two diverse audiences; algebraic geometers 
and researchers from a variety of applied fields studying tensors. With algebraic 
geometers in mind, our goal is to show how well-known techniques in algebraic ge- 
ometry can be used to solve problems in applied areas. Most of our constructions 
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are better explained with the geometric language of vector bundles. However, we 
did our best to first state the main results by using explicit matrices, without 
the language of vector bundles. For this reason, we defer the main proofs un- 
til Section HJ in the sense that almost all our results are particular cases of a 
general result (Thm. I4.7|) . which is stated with the language of vector bundles. 
Keeping in mind researchers studying applied tensor problems, in Section [5] we 
describe the history and state of the art of algebraic geometry concerning tensor 
decomposition from a practical point of view. Because in their original versions 
the statements may not have been so accessible for applications, we have restated 
results from algebraic geometry, hopefully in a more transparent language, con- 
cerning generic rank (see Thm. 12. 2p and uniqueness of tensor decomposition (see 
Thm. [2~3|) . In Section [3j we go on to illustrate our new techniques that use Koszul 
matrices to compute Waring decomposition. As mentioned above, in Section H] 
we give the generalization of our techniques. Finally, in Section [7] we describe our 
Macaulay2 implementation of our algorithms, and we hope that this will serve 
as a starting point for further implementations of these methods. 

2. Classical methods for tensor decomposition: Sylvester's 

catalecticant method 

2.1. General results on the symmetric rank. 

Remark 2.1. We use the term "general element" in the sense of algebraic geome- 
try to indicate that the element is chosen to avoid a (Zariski) closed set. So when 
we say that the general element in a variety X has a property, this means that X 
contains a dense subset X° such that every element in X° satisfies that property. 
We call a property "generic" if it holds for general elements. We consider the 
"general" assumption a mild assumption because in practice, most tensors we 
encounter in nature will actually be general. It is no loss to replace "generic" 
with "almost always" and "general" with "randomly chosen," or "up to certain 
non- degeneracy conditions." 

Regarding ranks of tensors, the following capstone theorem answers the ques- 
tion completely for general symmetric tensors. 

Theorem 2.2 ( |AH95j ). Let V be a complex vector space of dimension n + 1. 
The general / G S d V has rank 

n + l ' 

which is called the generic rank, with the only exceptions 

• d = 2, where the generic rank is n+l. 

• 2 < n < 4, d = A, where the generic rank is ( n ^ 2 ) • 

• (n, d) = (4,3), where the generic rank is 8. 
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Note that we use the functions [•] and |_ - J respectively to indicate round-up 
and round-down. The elements of rank one in S d V are just the polynomials 
that are d-th powers of a linear polynomial. They form an irreducible algebraic 
variety, which is the cone over a projective variety which is called the d- Veronese 
variety of P n and we denote ^(P™). For all values of k less than the generic rank, 
the (Zariski) closure of the elements of rank < k is a irreducible variety, which is 
the cone over <T/ c (wrf(P n )), the latter is called the k-th secant variety of f^(P n ). A 

consequence of the Alexander-Hirschowitz theorem is that if k < > then 

dim<7fc(fd(P n )) = k(n + 1) — 1 (the expected dimension) with the only exceptions 

• d = 2, 2<k<n 

• 2<n<4,d = A,k = ( n + 2 ) - 1 

• ( n ,d) = (4,3), k — 7. 

The cases listed above are called the defective cases. 

After the rank of a tensor is known, the next natural question is whether there 
is a unique decomposition (ignoring trivialities). The following represents the 
state of the art regarding this question. 

Theorem 2.3 ( |UU02| . |Mel06j . |BaL05j ). For all values of r smaller than the 
generic rank, the general element of rank r in S d V has a unique (up to scaling) 
decomposition f = Y^i=i c i( v i) with the only exceptions 

(1) the defective cases, where there are infinitely many decompositions 

(2) rank 9 in S 6 C 3 , where there are exactly two decompositions 

(3) rank 8 in S^C 4 , where there are exactly two decompositions. 

The cases listed as (2) and (3) in the above theorem are called the weakly 
defective cases. For the generic rank, it is known that when n + 1 does not divide 
("d'O then there are infinitely many decompositions. On the other hand, when 
n + 1 divides ( n ~j i d ) , apart from the defective cases, then there are finitely many 
decompositions. In the latter case, it is expected that the decomposition of a 
general element in S V is rarely unique. In this situation, the only cases where 
uniqueness is known to hold are the following: 

• S d C 2 , for odd d, which was addressed by Sylvester in 1851. 

• 5* 5 C 3 , which is the case addressed in the introduction. 

• 5* 3 C 4 , the uniqueness result for the general rank, which is 5, is known as 
the Sylvester Pentahedral Theorem. 

It is expected that these three are the only cases where uniqueness of decompo- 
sitions hold for the general element. Partial results confirming this expectation 
are proved in |Mel09j . 

A consequence of our construction is that we can treat all three of these cases in 
a unified manner. The fact that our construction only finds these three cases gives 
further evidence that these may be the only exceptional cases for uniqueness. 

Despite this beautiful theoretical picture, it is hard to compute the rank of 
a given symmetric tensor and to find explicitly its tensor decomposition. The 
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brute force attempt to solve ([I]) in the unknowns defining each v $ with a computer 
algebra system is time and memory consuming and often fails, even in small 
dimension. In fact, it is known that most tensor problems are extremely hard 
and often unsolvable [HL09j . So while our goal is to make improvements in 
efficiency and reliability of tensor decomposition algorithms, we know that the 
generic problem for large tensors will remain difficult. 

2.2. The catalecticant method. The catalecticant method was developed in 
the XIX century, by Sylvester and others, to compute the rank of a symmetric 
tensor in S d C n+1 and to compute its Waring decomposition. The method is 
completely successful in the case of binary forms, i.e. n = 1, but gains only 
partial success for n > 2. It is important to understand it deeply, because most 
of successive methods proposed, including our method developed in this paper, 
can be considered as a generalization of the catalecticant method. 

Let {xj}i<j<„ + i be a basis of a vector space V. Given / £ S d V one can define 
maps for each m < d 

Cf: S m V* — >S d ~ rn V 

( 5 ) T . ... T . ^ &21 

Seen as a matrix, C™ is known as a catalecticant or (semi-)Hankel matrix. From 
(the more recent) point of view of tensors, C™ can be thought of as a symmetric 
flattening of a symmetric tensor, which is a symmetric version of a flattening of 
a tensor. For this and other types of flattenings see |LQ11] . 

Note that if / = l d then rankC™ = 1, hence if / has rank r then rankC™ < 
r. This gives lower bounds on the rank of /. But in fact, in this case, when 
CJ 1 is non-trivial we will be able to further exploit this construction to aid in 
decomposing /. 

In their book |IK99j . Iarrobino and Kanev described the key ingredients to the 
classical catalecticant method. In particular they describe an algorithm to find 
a Waring decomposition. We have implemented this approach, see Section 17.11 
Here is a summary of the algorithm. 

Algorithm 2 (Catalecticant Algorithm). |IK99t 5.4] 
Input: / £ S V, where dim V — n + 1. 

(1) Construct, via (0), the most square possible catalecticant CJ 1 = Cf with 

(2) Compute kerC/. Note that rank(J) > rank(C/). 

(3) Find the zero-set Z' of the polynomials in kerC/. 

(a) If Z' is not given by finitely many reduced points, stop; this method 
fails. 

(b) Else continue with Z' = {[vi], . . . , [v s ]}. 

(4) Solve the linear system defined by / = ^* =1 Civf in the unknowns q. 

Output: The unique Waring decomposition of /. 



s 
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Algorithm [2] is a special case of Algorithm [4] below, where we take E = 0(m) 
and L = 0(d). The following theorem gives a sufficient condition that guarantee 
its success. It can be seen as a slight improvement (at least for n > 3) of the 
bound described in |IK99] . see Theorems 4.10A and 4.10B. 

Theorem 2.4. Suppose f = J2l=i v f ^ s a general form of rank r in S d V , let 
Zt = [vi] G P(V) be the corresponding points and let Z = {z±, . . . ,z r }. Set 
m=\f\. 

(1) If d is even and r < ( n ^ m ) — n — 1 or if d is odd and r < ( n+ ™ _1 ) , then 

(6) keiC f = I z , m , 

where Iz,m C S m V* denotes the subspace of polynomials of degree m 
vanishing on Z C V . Moreover Algorithm^ produces the unique Waring 
decomposition of f . 

(2) Finally if d is even, and r = ( n ^ m ) — n, then it is possible that Z C Z' , 
where Z' is obtained by Algorithm^ But still when n = 2, the algorithm 
will produce the unique minimal Waring decomposition. Further when 
n > 3, the algorithm will succeed after repeating step (4) finitely many 
times using subsets Z" C Z' of size rank(Cj). 

As mentioned in the introduction, many of our statements in this section are 
consequences of more general results that are proved later in Section HI While 
the reader may better understand the following arguments after the methods 
in Section H] are presented, we believe that it is worthwhile to anticipate their 
use now so that the simple pattern in this case may be seen. 

Proof. First notice that by Thm. 12. 3[ we know that in this case we have a unique 
decomposition. 

Here we use the more general theory applied to this specific case. Now we 
prove (1). ByProp. 14.11 with E = 0(m) and L = 0(d), we have the inclusion 
Iz,m Q kerC/, and by the same Proposition, the equality holds if we can show 
that the map H°(E* ®L) — >H°(E®L\z) is surjective. For our particular choice 
of E and L, we are considering the map H (O(—m + d)) — >H (O(—m + d)\z)- 
Let m' = d — m = [|J . So this amounts to showing that the natural evaluation 
map 

S m 'v* — >c 

f^(f( Vl ),...J( Vr )) 

is surjective. But this is equivalent to the condition that codim(J^ im /) = r. The 
preceding condition is satisfied because (by a basic dimension counting argument) 
r general points give independent conditions on hypersurfaces of degree m' when 
r is bounded by ( n ^T)- It follows that rankCj = r and we can apply Prop. 14.31 
and conclude that Iz, m ' = kerC/. 

Next we will use the following theorem. 
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Theorem 2.5. [CC021 Theorem 2.6] , Assume thatr < ( n + m )-n-l. Let X be an 
irreducible projective variety. For r > 3, if every (r — 2) -plane spanned by general 
points xx, . . . , x T -i also meets X in an r-th point x r different from x\, . . . , x r -\, 
then X is contained in a linear subspace L, with codirrii(X) < r — 2. 

Let Z' = baseloc(ker Cf). We claim that Z' = Z, and the equality Iz, m — 
kerC/ implies that the points Z' can be used to give the decomposition of /. 
Indeed, we just showed that Z' = baseloc(Jz,m)j and the latter is equal to Z by 
applying Theorem 2.6 in |CC02] . stated above. 

In the odd case, note that we will have Cf. S m V — >S m ~ 1 V, and we always 
have ( n+ ™ _1 ) < ( n t m ) - n - 1. 

For the proof of (2), now Z' will be given by m n points, complete intersection 
of n hyper surf aces of degree m. Note that for n > 2 we have ( n + m ) 
For n = 2, the m 2 points impose independent conditions on the hypersurfaces 
(curves) of degree 2m. Indeed, from the sequence 

— >0 — vO(mf — vZ z ,(2m) — >0, 

one shows that h l {Tz'(2m)) = 0. Denote Z' = {v\, . . . , v m 2}. The last vanishing 
implies that the powers (vi) 2m for % — 1, . . . ,m 2 are linearly independent. Then 
the linear system in step (4) in the algorithm has a unique solution , which 
moreover only uses rank(C/) of the points. 

For n > 3, since the vf are no longer independent, the linear system in step (4) 
of the algorithm will no longer have finitely many solutions using all the forms. 
However we can overcome this problem by repeating step (4) with each subset 
of Z' of size rank(C/) until we find the decomposition. 

For the rest of the theorem, apply Algorithm |2] and refer to Thm. 14.71 □ 

2.3. Limits of the catalecticant method. As in the proof of Thm. I2.4[ let 

m' = [|J . Another way to see that the catalecticant method can work for 

polynomials of rank r only if r < ("^T ) , is the following. The maximum rank 
of any catalecticant matrix Cf: S m V* — >S m> V is If / has this rank or 

greater, we will either (in the even case) have no kernel to work with, or (in the 
odd case) fail to satisfy the equality Iz, m ' = kerC™. 

/n+d\ / 

Usually the general rank n d + l is larger than ( n ^T ) , the first example being 
^3(P 2 ), where the catalecticant matrices have size 3 x 6 or 6 x 3. The equation of 
a^(v^(¥ 2 )) , which is called the Aronhold invariant, cannot be found as a minor of 
any catalecticant matrix. It can be obtained as a Pfaffian of a Koszul flattening 
(see |Ott09j ). In practice, one should start by applying the catalecticant method, 
and if it fails to produce the Waring decomposition, this implies that the rank 
is larger than the bounds listed in Thm. 12.41 Next we introduce new algorithms 
that will succeed to decompose tensors in a larger range of ranks. 
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3. New methods for tensor decomposition: Koszul flattening and 

eigenvectors of tensors 

Note that the following definition also appeared in |Ott09] in a few special cases 
and was further developed in |L011j . both with a focus of finding equations of 
secant varieties. We also note that this construction was used in the case of 
partially symmetric tensors to find the ideal-theoretic defining equations of the 
fc-th secant variety (with k < 5) of Seg(F 2 xP n ) embedded by 0(1, 2) in jCEOll] . 
Here our presentation is focused on using this construction to find decompositions 
of tensors via eigenvectors. 

The general setting of this section is the following. Let / G S d V and fix 
< a < n, 1 < m < d — 1. We construct a linear map 

(7) P f : Eom{S rn V, ffV) Rom(ff~ a V, S^^V). 
If / = v d then the definition of P v d is the following 

(8) P v a{M){w) = (M(v m ) Av Aw) (t^-™- 1 ), 

where M G Hom(S' m V, f^V), w G f\~ a V and we fixed an isomorphism f\ +l V ~ 
C. If / any element of S d V, the definition of Pf is extended by linearity. This 
extension is guaranteed by the fact that on general decomposable tensors / = 
v\ ® . . . ® Vd the map Pf has the following expression 

P Vl ®...® Vd {M)(w) = ^ ® • • • ® V a (m)) A V a{m+1 ) A w) CT ( m+ 2)<S>- • -®V a{d) ), 

which is visibly linear, where the summation is performed for a in the symmetric 
group of permutations on d elements. 

Although this definition might seem artificial at first glance, we now explain 
how it can be used. We wait until Section 14.21 for a more formal treatment of 
Pf via a presentation of a vector bundle. The linear map Pf can be explicitly 
computed by using Koszul matrices, which motivates the name Koszul flattening 
that we give to Pf and is intended to mirror the term symmetric flattening which 
it generalizes. 

In order to explicitly write down the matrix representing Pf, we need to recall 
the properties of the Koszul complex. It is the minimal resolution of the field 
C as an R = C[xq, . . . , x n ]-module. Here we give some examples, but interested 
readers unfamiliar with the Koszul complex may wish to consult [Eis05| . For 
n = 2 the Koszul complex is 

— >R{-3)^R{-2) 3 ^R{-1) 3 ^R — — >0, 

where 

fci = (x Xi x 2 ) 

(0 x 2 —xi 
-x 2 x 
x\ x 
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For n 



where 



3 it is 




R{-A) 



k,4 



kR(-3) 4 


-^R(-2) 






h 


= (%0 X\ 


x 2 x 3 ) 




f—Xi - 


-x 2 


-x 3 


\ 


x 


-x 2 


-x 3 








Xq X\ 





-x 3 


V 





Xq X\ 


X2 J 



►C— ►(), 



In general we have 



h: R(-i)\ i > — >R(-i + 1 



i-l) 



The matrix hi is a Koszul matrix. It corresponds to the presentation of the vector 
bundle f^\ +1 ~ l Q{—i) and on the point (v) G PV^ it corresponds to the wedge 



product /\ n+i l V J ^-/ l \ +z l V. We note that the Koszul complexes can be easily 
computed by any standard computational algebra system such as Macaulay2 
[GSIOj . _ 

Next we recall a result from |L011j that gives an explicit version of this con- 
struction. 

Lemma 3.1. Let f G S d V . The matrix P f : Eom(S m V, ffV) -> Hom(/\ n ~V, S d ~ m ~ l V) 
can be computed using the matrix k n+ i_ a of the Koszul complex, of size ( n+ ) x 
(a+i) > w h ere a t ihe place of the indeterminate Xi we substitute the catalecticant 



matrix of size ( 



n+d—m—l 



) x { n+ n m ), where f, 



df_ 



The matrix Pf obtained 



has size 



n + m 
n 



n + 1 
a 



x 



n 



d — m 
n 



Proof. See jLOTTl Section 8.3]. 
Now we propose the following 



□ 



Definition 3.2. Given M G Hom(S m V, f*\V), a vector v G V is called an eigen- 
vector of the tensor M if 

M(v m ) Av = 0. 

For m = a = 1 this is a usual eigenvector, and for a = 1, any m this agrees 
with the notion of |Lim05t Qi05| . 

Now we have the following important lemma (whose proof is straightforward) 
that we would like to emphasize. 

Lemma 3.3. Let M G Eom(S m V, 

(1) A vector v EV is an eigenvector of M if and only if M G ker(P„d). 

(2) Let f = Yl v f- If each Vi is an eigenvector of M , then M G ker Pf. 
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Cartwright and Sturmfels have recently found a formula computing the number 
of eigenvectors of a general tensor M G Hom(S' m V, V), which is the case a = 1 
in our construction. The following theorem generalizes the Cartwright-Sturmfels 
formula to any a. 

Theorem 3.4. For a general M G Hom(S m V, f\V), the number of [v] G FV 
such that M{v m ) Av = 0is given by 

m for a = 0, 2 and n — 1, 
oo u>/ien a = 0, n + 1 and n > 1. 

• /or a = 1 

m— 1 J 

• 0/or2<a<ra-2 

(m+l) n+1 +(-l) n j. 

• 1 m+2 fora = n-l. 

Proof. The non-trivial cases follow from Propositions 16.11 and 16.21 □ 

For a = the condition becomes M(v™~) = and Pf reduces to the catalecti- 
cant. The cases a = 1 and a = n — 1 are a bit special. Note that Hilbert's quintic 
example from the introduction fits the case a = 1 (and agrees with the case 
a = n — 1 since n — 2), while we will see that Sylvester's pentahedral example 
fits in the case a — n — 1. 

As we mentioned in the introduction, in the case a — 1, the iterative methods 
of |KM10] may be used to find eigenvectors of tensors in Hom(5' m V, V), however 
for general a, more work needs to be done in order to efficiently find eigenvectors 
of tensors in B.om(S m V, ffiV). The idea, like in the matrix case, is to iterate the 
map 

Vk+1 ~ \\M{ V f)\y 

and to successively approximate the eigenvectors, starting from the dominant 
one and repeating until all eigenvectors are found. 

Next we describe a general algorithm to decompose polynomials, which cor- 
responds to the case a — 1, any m. This algorithm is especially effective in the 
case n = 2. 

Algorithm 3 (Koszul Flattening Algorithm 1). 
Input: / G S d V, where dim^/ = n + 1. 

(1) Construct, via © and ©, Pf. Rom(S m V, V) -> Hom(A n "V, S^^V). 

(2) Compute kerP/ and note that rank(/) > rank ( p /) m 

(3) Find Z' , the common (projective) eigenvectors of a basis of the kernel of 
Pf- 

(a) If Z' is not given by finitely many reduced points, stop; this method 
fails. 

(b) Else continue with Z' = {[vi], . . . , [v s ]}. 

(4) Solve the linear system on the constants q defined by setting / = J2i=i °i v t- 
Output: The unique Waring decomposition of /. 
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Now we state sufficient conditions in the case n = 2 and d is odd for Algo- 
rithm [3] to succeed. (In the case d is even we don't state conditions because 
our experiments show that the Catalecticant Algorithm covers all cases that the 
Koszul algorithm can cover). 

Theorem 3.5. Suppose n = 2 and set d = 2m + 1. Let f = YH=i v t be a general 
form of rank r in S d V , let = [vi] G P(V) be the corresponding points and let 
Z = {zi, . . . ,z r }. Let Z' be the set of common eigenvectors (up to scalars) of 
ker Pf . 

(1) If 2r < m 2 + 3m + 4 then Z' = Z. Moreover Algorithm^ produces the 
unique Waring decomposition of f . 

(2) If 2r < m? + 4m + 2, then it is possible that Z C Z' . Even in this case, 
Algorithm^ will produce the unique minimal Waring decomposition. 

We postpone the proof of Thm. 13.51 until we have the tools from the next 
section. 

Before going on, we note that the map Pf factors, and therefore it will have 
smaller rank than what may be first expected. In order to accurately explain 
this, we will need to use representation theory and the language of partitions 
and Young diagrams according to [FH91j . 

The map Pf always has a non-trivial kernel, which comes from an analogy 
to the matrix case and the fact that every vector is an eigenvector of a scalar 
multiple of the identity. First we notice that Hom(S m V, f^V) splits as the direct 
sum of two SL(V)-modules, via the Pieri rule, (see [FH911 equation (6.9), p. 79]) 

(9) Eom(s m v, AV) = r m " ® f^v = r( m+1 ) a > m "~ a © T {mY-\{m-ir-^ ? 

where in general for any partition tt, T w is the SL(V^)-representation associated 
to it. In partitions, we use the notation m l to denote m repeated % times. Note 
that T n inherently depends on V, but we suppress this from the notation for 
simplicity. 

Lemma 3.6. Pf restricted to Y^" 1 ^ 1 '( m_1 ) n a+1 i s zero. 

Proof. By the linearity of Pf in f, it suffices to prove the lemma for f = v d . The 
essential fact that we use is that the representation T ma 1 '( m_1 ) n a+1 is isomorphic 
to a subspace of B.om(S m ~ l V, f^^V), which indeed splits as 

Hom(5 ,m_1 v, /\ a_ V) = r ma " 1 '( m- " 1 ) n_a+1 r (m " 1)a ' 2 ' (m " 2)n ~ a+2 . 

So consider M G r"i a - 1 ,(»»-i) a - 0+1 c Hom(S m V, ffV) . There is a natural 
equivariant map 

Hom(5 m ~ 1 V, A a " V) ->■ Hom(S m V, ffV) 
N ^ [v m Niv™- 1 ) A v] 

which is nonzero, so by Schur lemma it identifies the summands T ma >( m-1 ) n a+1 
in both sides. We write M as the copy of M in Hom(S' m_1 y, A"" 1 ^) according 
to this identification. 
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Then M(v m ) = MO™" 1 ) A v, so for this M, 

P vd {M){w) = (M(v m ) AvAw) (^ m " 1 ) = (M{v m - X ) A v A v A w) (v d ^ 1 ) = 0. 

□ 

3.1. The quotient bundle and eigenvectors of tensors. We show in this 
section that the eigenvectors of a tensor can be interpreted as zero loci of sections 
of twists of the quotient bundle. Similar methods were recently used in |OS10j 
to study matrices with eigenvectors in a given subspace. For generalities about 
vector bundles we refer to [QSS80] . FV is the projective space of lines in V, 
therefore H°(FV, 0(1)) = V*. The quotient bundle of FV which we will denote 
by Q, appears in the Euler exact sequence 

o-^ew(-i)^ew ® v— k), 

taking wedge powers and tensoring by 0(m) we get the sequence 
A a "V ® 0(m - 1)— V\V ® O(m)^ffQ(m)-^0. 
Taking the global sections we get 

(10) RomiS^V, A a "V)— ■» Eom(S m V, ffV)^H° (tfQ(m)) — K). 

For any tensor M G Hom(S m V, f^V) we denote by sm the section of / s \Q{m) 
corresponding to (j)(M) in sequence fllOl) . We want to show that the eigenvectors 
of the tensor M correspond to the zero locus of sm- To make this construction 
precise, we recall the following straightforward lemma. 

Lemma 3.7. 

(1) The fiber of f s \Q{m) at x = (v) is isomorphic to Hom((t> m ), /*\V/(v A 

A a -V». 

(2) The section sm vanishes in (v) if and only if v is an eigenvector of the 
tensor M . 

Proof. The section sm £ H°(/^Q{m)) corresponds on the fiber of v to the com- 
position (v m ) — l -^S m V — >f*\V-^-/^V/ (v A A" -1 ^) where % is the inclusion and 
7r is the quotient map. Now sm vanishes on (v) if and only if 7r(M(v m )) = if 
and only if M(v m ) A v = 0. □ 

Remark 3.8. Note that in the decomposition of formula B.om(S m V, V) = 
r m " ® V = r m+1 > mn ~ 1 QS™-^*, we have from Bott's theorem that H°{Q{m)) = 

pm+l,m™ _1 

Now we have the proper tools to anticipate the proof of Thm. 13.5} even if we 
need some results of Sections H] and 

Proof of Thm. \3.5[ First notice that by Thm. 12.31 we know that in this case we 
have a unique decomposition. 

Now we prove (1) by applying Prop. 14. 11 To match with the notation of 
Prop. 14. 1} take E = Q(m), a twist of the quotient bundle on P 2 , and L = 0(d) 
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with d = 2m + 1. (See also Example 14.21 for more details on the matrix version of 
this construction.) By Prop. |4~T| we have the inclusion H°{Q®Xz{m)) C ker Af. 
Note that E* ® L = Q*(-m + 2m + 1) = Q*(m + 1) = Q(m) = E. The 
equality holds if the map H°(Q(m)) — >H°(Q z (m)), is surjective. This is true 
by Thm. I5.1( i). 

It follows that rank Af = rankPj = 2r and we can apply Thm. 14.71 Moreover, 
in this case, Z' = Z. Indeed, we just showed that Z' is the base locus of 
H°(Q £g> Xz(m)), and the latter is equal to Z because of (ii) of Thm. 15.11 

For the proof of (2), now Z' is contained in the zero-locus Z" of a section 
of Q(m). It is enough to show that Z" imposes independent conditions on the 
hypersurfaces of degree 2m + 1. Indeed, from the sequence 

— >0 — >Q*(m + 1) — Yl z „(2m + 1) — >0, 

and the vanishing h\Q*(m + 1)) = 0, |OSS80] Ch.l § 1, reading Q*{m + 1) = 
Q l (m + 2), one shows that h 1 (Xz"{2m + 1)) = 0. Then the linear system in 
step (4) in Algorithm [3] has a unique solution (because the powers vf are linearly 
independent), which moreover only uses rank(P/)/2 of the points. □ 

3.2. Pentahedral example. Here we treat the Sylvester pentahedral example, 
in an analogous way to the Hilbert quintic case treated in the introduction. Let 
/ G S* 3 C 4 . The classical approach to the Pentahedral Theorem is to use the 10 
points pi which are the singular locus of the Hessian of /. These ten points are 
the vertices of the pentahedral formed by the five planes V{. We do not know 
how to express the five planes rationally from / with this approach. Enriques 
and Chisini, in the third book of their textbook |EC18j . attribute to Gordan the 
computation of the fifth degree covariant of / given by the five planes, in terms 
of symbolic calculus, but we found difficult to explicitly compute the Gordan 
covariant. 

Our approach covers both the proof of the theorem and the possibility to find 
explicitly the five planes. 

Theorem 3.9. Sylvester Pentahedral Theorem For any general f E S^C 4 , 
there exist unique Vi G C 4 (up to scalar) and q G C for i — 1, ... ,5 such that 

i=l 

The algorithm to find the is described in the proof. 

Proof. In the pentahedral case we have / G S 3 C A . Set a = 2, m = 1 in the Koszul 
flattening construction. This corresponds to constructing Pf : Hom(C 4 , /\ 2 C 4 ) — > 
Hom(C 4 , C 4 ). This is a 16 x 24 matrix coming from the 4x6 Koszul matrix 



k 2 = 



/ 


-x 1 


-x 2 





-X3 





\ 




x 





-x 2 





-x 3 










X 


Xi 










V 











•'■(-) 




x 2 ) 
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and substituting the 4x4 catalecticant matrix Cj. at each occurrence of Xj. 
Using Macaulay2 |GS10] it can be checked that the kernel of Pf has dimension 9 
and it is spanned by 9 vectors in C 24 ~ Hom(C 4 , f\ C 4 ) written as w^x" 1 A x J 
that can be grouped as {wqx, w Q2i w i2i w 03> w i3i ^23} with £ C 4 * linear forms. 

The general element M of the kernel can be computed, again with the help 
of Macaulay2, by a random linear combination of the nine elements of the basis 
and it has exactly five eigenvectors Vi (up to scale), in agreement with Prop. 16.21 
thus proving the existence and uniqueness statement of the theorem. Explicitly 
the five eigenvectors (which are dual to the planes of the pentahedral) can be 
computed using the 4x4 minors of 



/ x 2 


£3 








Wox\ 


-Xx 





x 3 





VJ02 


x 








£3 


WX2 





-x x 


-x 2 










x 





-x 2 


Wl3 


V 





•''0 


Xx 


w 2 J 



where the first 4 columns express k%. 

The zero locus of these minors is indeed formed by the five points corresponding 

to V{. □ 

Remark 3.10. The recent results in the nice paper [BCMT10] makes use of a 
different generalization of the classical methods for tensor decomposition. Their 
methods make use of a (semi-)Hankel operator constructed from all possible 
catalecticant matrices, and compute the linear forms in the decomposition of a 
tensor utilizing a method of zero- dimensional root finding involving simultaneous 
eigenvectors of companion matrices. In theory, this method eventually covers 
all cases to decompose a general symmetric tensor, however there is an essential 
difference with our methods. That is, their method relies on numerical techniques 
in order to construct the Hankel operator. As a consequence, for example in the 
pentahedral case, their technique gives the ideal of the 5 points numerically, 
while in our algorithm, we have found the ideal of the 5 points symbolically, and 
numerical techniques are used only to compute the individual points. 



4. New methods for tensor decomposition: bundle method 

4.1. The bundle construction. Let L be the line bundle on X which gives the 
embedding X C F(H°(X, L)*) = FW. In particular L = 0(d) on F n = FV gives 
the embedding of the Veronese variety X = Vd(JP n ) = Vd(FV), where in this case 
S d V = W, defined in the introduction. 

Let E be a vector bundle on X C FW. In [LOllj a linear map Af was con- 
structed, depending linearly on / £ W, which comes from the natural contraction 
map 

(11) H°(E)®H°(E* ®L)—+H°(L). 
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From (11 II) we get a linear map 



depending linearly on / G H°(L)*. 

Our starting point is the following result from |L011j . 

Proposition 4.1. |L011l Proposition 5.4.1] Let f = YH=i v i e W w ^ z i = 
[Vi] e X C WW and put Z = {z 1 , ...,z r }. 



The first inclusion is an equality if H°(E* (g) L) — >H°(E* £g> L\z) is surjective. 
The second inclusion is an equality if H°(E) — >H°(E\z) is surjective. 

Recall that the base locus of a space of sections of a bundle is the common 
zero locus of all the sections of the space. Again, let E be a vector bundle on 
X C F(H°(X,L)*). Let / = £ NeZ ^ G H°(X,L)*. Assume that H°(E* <g> 
L) — >H°(E* ® L| Z ) is surjective. Then the kernel of A/ : H°(E) -> <g> L)* 

is equal H°(Zz ® -E). So, if the base locus of H°(Zz ® E) is Z itself, then the 
decomposition of / can be computed from the base locus of kerA^. 

Some advantages of our algorithm are now apparent. First, kerAf can be 
computed by an explicit matrix construction that we give below and methods 
from linear algebra can be used to compute this kernel. Recall that there is 
always a brute force method where one guesses a rank r, chooses r linear forms 
Pi = Pifao, ■ ■ ■ ,x n ) each depending onn + 1 parameters, and tries to solve the 
system of polynomials given by comparing coefficients on the expression / = 



So if one compares our method to the brute force method, kerAf consists of 
polynomials of lower degree than the original polynomial /, and in general a sys- 
tem of polynomials of lower degree should be easier to solve than one consisting 
of polynomials of higher degree. We tested our algorithm using r randomly cho- 
sen linear forms and tried to decompose (the expanded form of) / = Yll=i If- 
The brute force method quickly fails (we run out of memory and time) even for 
the pentahedral example, where our algorithm succeeds in less than one second. 

4.2. A side remark on presentations. A presentation of a bundle is what 
allows us to make the transition between vector bundles and matrices. For a 
bit more on presentations we invite the reader to consult |Eis05l Chapter 6C] . A 
presentation of a bundle E on WV is constructed as follows. We have the (finite) 
minimal resolution of E, which is 



H°(E) ® H°(L)*—+H°(E* ® L)*, 



and this can be seen as a linear map 
(12) A f :H\E) 




H°(l z <g> E) C keiAj 
H°(X Z <8> E* (g) L) C (imA f ) 



(13) 
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where each Lj is a direct sum of line bundles, and has the property that the 
induced map H°(Li) — >H°(E) is surjective. This follows because the resolution 
is constructed as the sheafmcation of the corresponding resolution of graded 
modules. 

Moreover we have the (finite) minimal resolution of E* which is 

(14) ... — >L*_ X — >L* Q — >E* — >0, 

where each Li is again a direct sum of line bundles, which has the property that, 
even tensoring with a line bundle L, the induced map H°(Lq®L) — >H°(E* ®L) 
is surjective. Dualizing, we get a double resolution 

• • • — >L>2 — >Li~^-L — >L-i — > . . . , 

where im(p) = E. The map L±— ^-L gives the presentation of E. We get the 
composition 

(15) P f = f3oA f oa: H°(L 1 )-^H°(E)^Uh°(E* <g> L)*^H°(L* ® L)\ 

where a is surjective (because of (1131) . which is the sheafification of the minimal 
free resolution of the module @ m H°(E{m))) and (3 is injective (because of (TT4"j) ). 
It can be shown that the matrix of Pf can be constructed just by the presentation 
p with a block structure obtained as follows. If p has a linear entry depending 

on Xi, substitute the catalecticant matrix C a/ for each X{ (where the size of the 

ax i 

matrix is to be determined by L\ and Lq), and if p has non-linear entries, replace 
each monomial in the Xi by the catalecticant matrix of the associated derivative 
of /, for more details see |L011t Section 8.3]. 

Hence rank A f = rankP^ and, even more important kerAf and ker Pf have 
the same base locus. The advantage is that Pf can be explicitly computed 
from a matrix with entries homogeneous polynomials, and ker Pf is spanned by 
computable polynomials. 

Example 4.2. For a basic example, suppose d = 2m + 1, and let E = Q(m) be a 
twist of the quotient bundle on P 2 = FV, as in the case of Thm. 13.51 
This gives a presentation 

L x = 0{m) ® V^Oim + 1) <g> V* = Lq, 

which is part of the Koszul complex 

— >0{m - 1) — >0(m) <g) V^O(m + 1) ® V* — >0{m + 2) — >0 

After an appropriate choice of basis, p may be represented by the matrix 

X2 —X\ 

—x 2 x 
x\ — xo 

which is one of the Koszul matrices we have already seen. Then H°(Li) = 

s m v* ® V. 
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Now let L = 0(2m + 1). For any / G S 2m+1 V* = W, A f is the morphism 
from H°(Q(m)) to its dual #°(Q*(m + 1))* and 

P f : = Hom(5 m l/, V)-^ Hom(V, S m V) = tf ^ ® L)* 

is represented by the matrix 






run 


-c Ti 


_< ~72 









_L 7o 






where : S^V — >■ S' m y* are catalecticant matrices of the partial derivatives 
ft = §x~- From this presentation, we can see also that Pf is skew-symmetric. 
Note that when / = Xq 1 ^ 1 is the power of a linear form then the above matrix 
has rank 2, which indeed is the rank of E = Q(m). We further remark that 
then the principal Pfaffians of this matrix give equations for secant varieties of 
v 2m+1 (F 2 ), see jLOTT] . 

4.3. Vector bundles, statement and proofs of the main results. For this 
section we assume the following general setup. Let X be an algebraic variety and 
let L be the line bundle on X which gives the embedding X C ¥(H°(X, L)*) = 
FW. For / G W, let / = J2i=i v i be a minimal decomposition, let Zj = [v.-] G 
P(W) be the corresponding points and let Z = {z\, . . . , z r } For a vector bundle 
E over X, construct the map Af. H°(E) — > H°(E* <g) L)* as above. 

Proposition 4.3. Assume that rankAj = k ■ rankE. Then we have 

H°(T Z ®E) = kerAf 

H Q (X Z ®E*®L) = (im Af) 1 . 

Proof. Prop. 14.11 says that we always have the inclusion H°(Tz <8> E) C ker Af. 
But in fact we get equality because of the following dimension argument. 

codim(H°(Iz <S> E)) < k ■ rankE = rank(Aj) = codim(ker Af). 

The same argument applies to the second equality. □ 

We can give a general criterion 

Theorem 4.4. Assume that rank A f = k ■ rank-E and 

H°(X Z ®E)® H°{l z ® E* ® L) -> H°(2| <g> L) 

surjective. 

Assume that X is not k-weakly defective, then the common base locus of ker Af 
and of (im Af) 1 is given by Z itself, hence Z can be reconstructed by f. 



Note that the notion of /c-weakly defective has been introduced in |CC02[ 
Definition 1.2]. See also Thm. 12.31 and the paragraph thereafter. 
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Proof. We can use the equalities ofProp. l4~3l Assume that the common base locus 
of ker Af and of (im Aj) 1 contains Z U {z'}, then every element of H°(I Z 2 <g> L) 
vanishes doubly on z', that is the general hyperplane section (in the system 
H°(L)) of X which is singular at Z is singular also at z', and this contradicts 
[CC021 Theorem 1.4]. □ 

Example 4.5. The example of plane sextics vq(JP 2 ), where Z is given by 8 points 
is instructive. In this case, set E = 0(3) and L = 0(6), it is a classical fact 
that the map H°(l z <g> E) <g> H°(X Z ® E* ® L) -> H°(Z| ® L) is NOT surjective. 
Indeed, it is known from [LOllj that the catalecticant minors are not enough to 
give all the equations of cr 8 (t>6(IP 2 )), a Koszul flattening is needed in addition. In 
this case, the kernel of Af is spanned by a pencil of plane cubics and has base 
locus in 9 points, one more than the original eight. This is linked to the known 
classical fact that the pencil of plane cubics through 8 points has an additional 
ninth base point. Still, in this case, by (2) of Thm. I2.4[ Algorithm [2] succeeds to 
find the tensor decomposition. 

The theorem applies in the presentation setting as well. In the case E = 
KQ(5), there is the presentation p : L x = ftV®0{8) ->■ ff~ a V* ®0(5+l) = L 
such that imp = E. Then H°(Li) = B.om(S s V, f^V) and an element of H°(Lx) 
goes to a section of E vanishing in v if it corresponds to M G B.om(S s V, ffV) 
such that M(v s ) evA(^ 1 V), i.e. if v is an eigenvector of M. In this case, the 
base locus of the kernel can be studied directly in H°(Li). The criterion applies 
to specific tensors. 

We now come to the main practical result of this paper. 

Algorithm 4 (General algorithm to find tensor decomposition). 

Input: / G S d V where dim^/ = n + 1, E is a convenient vector bundle on 
f{V), to be chosen. 

(1) Construct the map Af as defined in ()12p . where L = 0(d). 

(2) Compute ker Af. If ker Af is trivial, stop, this method fails. 

(a) note that rank(/) > ■ 

(3) Find the base locus Z' of ker Af by explicitly computing ker Pf as in (fTBT) . 

(a) If Z 1 does not consist of a finite set of reduced points, stop; this 
method fails. 

(b) Otherwise continue with Z' = {[i>i], • • • , [v s ]}- 

(4) Solve the linear system defined by / = Yli=i c i v t m the unknowns q. 
Output: If there is a unique solution to (4), this is the unique Waring de- 
composition of /. Else we possibly find many minimal Waring decompositions 
off. 

Note that Thm. 14.41 applies to step (3) in that it says that the baselocus 
computed actually consists of the linear forms used in the construction of /. 

Remark 4.6. In practice one finds that the algorithm will fail if either ker Pf is 
trivial, in which case we must conclude that the rank of the input is too large 
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to be decomposed by this method (see also Rmk. 14.81 below), or the base locus 
of ker Pf contains infinitely many points, in which case we cannot determine the 
decomposition. In the latter case we conclude that there is a positive dimensional 
variety on which our input tensor lies. In some pathological cases it may happen 
that the base locus of ker Pf is non reduced, also in this case we cannot determine 
the decomposition. 

The following theorem has already been applied in many specific examples, 
(starting from the case rkE = 1 which corresponds to the catalecticant case), 
that show the versatility and power of the result. 

Theorem 4.7. Let f E S d V , and set Z' = baseloc(ker Af), and let length(Z') = 

s. Assume that rank(/) = -^jfpjjr ■ Then for any minimal decomposition f = 

J2l=i °i v i w tth z i = [ v i] G X , Q G C and Z = {zi, . . . , z r }, we have Z C Z' . 

If length(Z') < oo, then Algorithm^ produces all minimal Waring decompo- 
sitions of f , in particular if the solution is unique, we find the unique Waring 
decomposition of f . 

Proof. Indeed let / = Xl!=i Z 1 ^ be a decomposition with minimal rank and set 
Z = {z h . . . , z r }, with zi = [vi]. ByProp. Olwe have H°(l z <8> E) = ker Af. It 
follows that Z' is also the base locus of H°(Iz <8> E), hence Z' D Z. If Z' is finite, 
then we can try the linear system in step (4), and with finitely many attempts 
we find a decomposition. Uniqueness implies that the linear system given by 
/ = J2l=i °i v i has a unique solution, thus producing the unique minimal Waring 
decomposition. □ 

To further clarify, we restate the previous algorithm in a bit more detail in the 
case E = /\ a Q(m), and in the presentation setting: 

Algorithm 5 (Koszul Flattening Algorithm (General)). 
Input: / G S d V, where V has basis {x , . . . , x n }. 

(1) Compute 5- = and 5 + = , and choose a = |~|] , the Koszul 
flattening to use. 

(2) Construct the Koszul matrices k p for p = n + I — a,n + 2 — a 

(3) Construct the catalecticants Cf t : S s+ V — > S S ~V, of /$ = for each i. 

(4) Construct the matrix Pf. B.om(S s + V, ffV) -> Hom( ff~ a V, S 5 - V) by 
substituting Cf i for X{ in the matrix fc„ + i_ a . 

(5) Compute a basis {Mi, . . . , M t } of ker Pf, and associate vectors of poly- 
nomials vli to each M,. If ker Pf is trivial, stop; this method fails. 

(6) Compute the eigenvectors {v±, . . . ,v s } of a general element in ker Pf as 
follows: 

(a) For each Wi compute the (™^) x (™^) minors of the block matrix 
( k n+2 -a | w ) and store these minors in an ideal J. 

(b) Find the set {v%, . . . ,v s } of common eigenvectors of all M in ker Pf 
by computing the zero-set of J. If s is infinite, stop, the method 
fails. Otherwise continue. 
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(7) Solve the linear system / = Yli=i c i v f ^ n the unknowns q. 
Output: The unique Waring decomposition of /. 

Remark 4.8. Notice that in the middle of our algorithm (step (5)), the algorithm 
will fail if a certain matrix we construct has a trivial kernel. This is an indication 
that the rank of the tensor is higher than the ranks which we can successfully 
decompose. We give precise bounds on the rank of the tensor (depending on the 
number of variables) for the success of this algorithm in Section [5j So the failure 
of the algorithm provides a lower bound for the rank of the input tensor. 

4.4. Quintic and Pentahedral examples revisited. Suppose d = 2m + 1. In 
the quintic example we set m = 2 and E = Q(2) on P 2 . The general section of 
Q{2) vanishes on 7 points. 

In the pentahedral example we set m = 1 and E = Q*(2) on P 3 . The general 
section of Q*{2) vanishes on 5 points. 

An interesting remark is that asking that h°(Q(m)) > nc n (Q(m)) (necessary 
condition to get that the zero locus of a section of Q(m) is given by c n (Q(m))Q 
arbitrary points, by counting parameters), one checks that the only solution for 
m > 2 to the diophantine equation resulting fromProp. l6.ll is given by n = m = 2 
(the quintic example). 

In the same way, asking that h°(Q*(m)) > nc n (Q*(m)), the only solution to the 
diophantine equation resulting fromProp. 16.21 for m > 2 is given by n = 3, m = 2 
(pentahedral example). In particular, our methods find all of the already known 
cases where the decomposition of a general symmetric tensor is unique, but do 
not provide any more uniqueness than was already known classically. This gives 
evidence that there may not be any other cases where uniqueness holds for the 
general tensor. 

5. Rank bounds for feasibility of our algorithms 

In this section we give bounds, depending on the number of variables and the 
rank of the tensor, for when our algorithm will succeed to produce the Waring 
decomposition of a given symmetric tensor. 

Below in Thm.[5J]and Thm. l5\3l we study bundles (over P 2 and P 3 respectively) 
twisted by an arbitrary integer m. More generally we consider F n and apply these 
results to prove Thm. 13.51 and Thm. 15.41 in the specific case that d = 2m + 1. 

Theorem 5.1. Let Q be the quotient bundle on P 2 and let Z be a finite collection 
of s general points in P 2 . 

(i) Ifs< |(m + 3)(m+ 1) and m > 0, then h°(Q®X z {m)) = (m + 3)(m + 
1) - 2s; 

this is equivalent to the fact that H°(Q(m)) — >H°(Qz{m)) is surjec- 
tive. 



! We recall a few facts about the Cliern class c„ in Section [6] 
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(ii) If s < \{yn 2 + 3m + 4) (m > 2) (or s < 3 for m = 1), then the base locus 
of H°(Q ®X z {m)) is given by Z itself. 

Proof. First note that, since Z is a general collection of points, it is reduced. 
By semi- continuity, both statements can be proved for a special collection Z. 
We prove the theorem by induction from m — 2 to m, so we have to distinguish 
the even and odd cases. The starting cases m = and m = 1 can be easily 
checked directly. If m — 1 then (i) says: if s < 4, then ®X^(1)) = 8 — 2s; 
Indeed an element A £ corresponds to a traceless endomorphism of 

V and by Lemma 13.71 it vanishes exactly at the eigenvectors of A. The space 
E(vi, . . . v s ) of traceless endomorphisms which have s general vectors V\, . . . v s as 
eigenvectors has dimension 8 — 2s for s < 4, which proves the initial case m = 1 
of (i). Moreover the common eigenvectors of the endomorphisms in E(v±, . . . v s ) 
for s < 3 are Vi, . . .v s themselves, which proves the initial case m — 1 of (ii). The 
initial case m = of (i) is easier, while the initial case m = 2 of (ii) corresponds 
to the plane quintic example developed in the introduction. In this case Z is 
given by seven points, H°(Q ®X Z {2)) is one dimensional and it is spanned by 
the unique section of Q(2) vanishing on Z, here it plays an essential role that 
C2(Q(2)) = 7. This example may be checked also in Macaulay2. 

First we prove (i). Let C ~ P 1 be a smooth conic. We recall that we have the 
splitting Q(m)\c = C|pi(2m + 1) © 0\pi(2m + 1). Indeed, from all the possible 
splittings 0\pi(a) © 0|pi(/3) with a + (3 = 4m + 2 the most balanced one is the 
only compatible with the vanishing H°(Q(—l)\c) = which follows from the 
exact sequence 

o— >>o(-3)— >>o. 



We choose to specialize s = min(s, 2m + 2) points as general points on C, and 
we call Z' C P 1 the subcollection obtained. Let Z" be given by the remaining 
points. Note that \{m 2 + 4m + 3) - \{m - 2) 2 + 4(m - 2) + 3 = 2m + 2, so 
that the length of Z" satisfies the assumption of the inductive step. There is a 
natural exact diagram 











lz» -2) 
0{-2) 







O 
Oz 






Xz',c 
O c 
O z > 
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Tensoring the first row by Q(m) we get the exact sequence 
(16) — >Q(m - 2) ®T z ,i — >Q(m) ®X Z — VL z ,\v (2m + 1) ©I z1 ,i (2m + 1) — >0, 
and the associated cohomology sequence 

0^H°(Q(m - 2)®l z „)^H (Q(m)®l z )^C 4m+4 - 2S . 

Hence 

h°(Q(m) <g> X z ) < h°(Q(m - 2) ® X z ») + ten + 4 - 25, 
and by the inductive assumption we get the inequality 

h°(Q®X z (m)) < m 2 + 4m + 3 - 2s. 
By considering the sequence 

— vQ®X z (m) — >Q(m) — MC 2 <g> C z — K), 

we get the opposite inequality, then the result. 

The proof of (ii) follows the same inductive step and we specialize the points 
in a similar manner. We choose to specialize now only min(s, 2m + 1) points 
as general points on a smooth conic C ~ P 1 . An anonymous referee pointed 
out that the case m = 3, s = 11 cannot be proved by induction because the 
case m = 1 our statement holds only for s < 3. Indeed the case m = 3, 
s = 11 has to be proved separately (by direct computation). Our computation 
in Macaulay2 that accomplishes this can be checked and repeated using the file 
"General Kappa method. m2" (see Section [7]) and setting the parameters s = 11, 
d — 7, n = 3. 

For any z' ^ Z we have to prove that there is a section a G H°(Q <g) X z (m)) 
such that cr(z') ^ 0. Note that by the inductive assumption there is a 1 G H°(Q® 
Xz"(m — 2)) such that cr'(z') ^ 0. Let / be the equation of C. If z' ^ C then 
a = cr'f works. The proof of (i) (specifically the sequence ([IB]) specialized to 
fewer points) shows that 

H°(Q ® ZsH)— Kff°(Q ® Z^,o(m)) 

is surjective. If z' G C there is a section <t" g H°(Q ® Iz',c(m)) which does 
not vanish in 2' (here we need that length Z' < 2m + 1) and there is a section 
in H°(Q eg) X z (m)) which restricts to a" and then does not vanish in z', as we 
wanted. □ 

Remark 5.2. Note that when m is even (m > 2) and 2s = m 2 + 4m + 2 then 
H° (Q ®X z {m)) is generated by one section, and the number of points in the base 
locus is given by c 2 (Q(m)) = 1 + m + m 2 > |(m 2 + 4m + 2) and the equality 
holds only for m = 2. The case m = 2 is indeed special, and it is the case of 
S" 5 C 3 addressed in the introduction. The corresponding tensor decomposition of 
S 5 C 3 (considered here in the introduction) was noted in |L011j . 

The following theorem is a generalization of Thm. 15.11 from P 2 to P 3 . In 
principle it is possible to obtain similar theoretical bounds for f^Q on P n , but it 
does not seem easy to find sharp bounds like those in Thm. l5.1l on P 2 . See the 
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Rmk. 15. 5 1 which shows that this topic can be tricky. We will continue to use Q for 
the quotient bundle on any P 3 (and later we will use the same Q for the quotient 
bundle on P n ), and when we restrict this quotient bundle to a smaller space we 
will indicate that as Q\^2 for example - we hope the usage will be clear in context. 
Recall, from the Euler sequence, that h°(F 3 ,Q(m)) = |(m + 4)(m + 2)(m + 1) 
and /r°(P 3 , Q*(m + 1)) = \{m + 4)(m + 3)(m + 1) for m > 0. 

In the proof of Thm. 13.51 there was extra symmetry (that the bundles E and 
E* ® L happened to be isomorphic in the case E = Q(m) and L = 0(2m +1)). 
Without this symmetry, we will need two analogous pairs of statements (stated 
next) in order to apply Prop. 14.11 for the proof of Thm. 15.41 

Theorem 5.3. Let Q be the quotient bundle on P 3 and let Z be a finite collection 
of s general points in P 3 . 

(i) Ifs < H(h (Q(m)) - m + 2) J = [± (|(m + 4)(m + 2)(m + 1) - m + 2) J 
and m>0, then h°(Q ®X z (m)) = Urn + 4)(m + 2)(m + 1) - 3s. This 



>H (Qz(m)) is surjective. 



is equivalent to the fact that H°(Q(m))- 

(ii) Ifs<l + 4)(m + 2)(m + 1) - ^ - ?f + 5) (m > 2) (or s < 4 

for m — 1 ), then the base locus of H°(Q (gil^(m)) is given by Z itself. 
(111) Ifs < [\{h°{Q*{m + 1)) - m + 1)J = [\ (|(m + 4)(m + 3)(m + 1) - m - 
and m > 0, then h°(Q* <g> l z (m + 1)) = \{m + 4)(m + 3)(m + 1) - 3s. 

1)) zs 



T/izs is equivalent to the fact that H°(Q*(m + 1)) — >H°(Q* 2 
surjective. 

(w) Ifs<l (±(m + 4)(m + 3)(m+l) - ^ - f - 8) fm > 2j ^or s < 5 /or 
m = 1), then the base locus of H°(Q* ®Xz{m + 1)) zs ^wen by Z itself. 

Proof. By semi-continuity, each statement can be proved for a special collection 
Z . We prove the theorem by induction from m — 1 to m. The starting cases 
m = 1 can be easily checked directly. Note that Q{m)\ ¥ 2 = Q P 2(m) © ¥ 2{m). 
We choose to specialize s = min (s, [|(/i°((3(m)|p2)J) points as general points on 
a hyperplane P 2 and we call Z' C P 2 the subcollection obtained. Let Z" be given 
by the remaining points. Let g(m) and f(m) respectively denote the numbers 



(17) 
and 



g[m) 



f{m) 



■(m + 4)(m + 2)(m 



m 



It is straightforward to check that f{m) — f(m — 1) 
of Z" satisfies the assumption of the inductive step. 
Then we have the exact sequence 



g(m), so that the length 



— >Q(m — 1) ®X Z " — >Q{m) ®X Z — >X z >\p 2 ®Q|p2(m) — >0, 
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3s. 



and the associated cohomology sequence 

0^H°(Q(m - 1) <g> l z „)^H\Q(m) ® I z )— >C h ° WM i' a) - 3a . 

Hence 

/i°(Q(m) ®X Z ) < /i°(Q(m - 1) ®Xz») + /i°(Q(m)| p2 ) - 3s, 

and by the inductive assumption we get the inequality 

h°{Q®X z (m)) < (m + 4)(m + 2)(m+ 1) 

By considering the sequence 

— >Q <g> X z (m) — ►Q(m) — ^C 3 <g> O z - 

we get the opposite inequality, then the result. 

The proof of (ii) follows the same inductive step and we specialize the points in 
a similar manner. We choose to specialize now only min(s, [| ((m 2 + 3m + 4) + ( 
points as general points on P 2 . Now let g'(m) and f'(m) respectively denote the 
numbers 



►0, 



m+ 



(19) 
and 
(20) 



g\m) 



f\m) = - 



m 



3m + 4) 



m + 2 
2 



1 , , . . . . m 2 3m 
-( m + 4)(m + 2)(m + l)-- 



Also in this case it is possible to check that f'(m) — f'(m — 1) — g'{m). 

For any z' £ Z we have to prove that there is a section s G H°(Q ®X z {m)) 
such that s(z') ^ 0. Note that there is s' G H°(Q (g) Z Z "(m — 1)) such that 
s'(^') 7^ 0. Let /i be the equation of P 2 . If z' £ C then s = s'h works. The proof 
of (i) shows that 

H°(Q ® l z (m))^H°(Q ® X z ,, p2 (m)) 

is surjective. If 2' G P 2 there is a section s" G H°(Q ®X Z i^2{m)) which does not 
vanish in z' (here we need (ii) of Thm. l5.ll) and there is a section in H°(Q®T z {m)) 
which restricts to s" and then does not vanish in z' , as we wanted. The proof of 
(hi) is very similar to the proof of (i) because Q*(m + 1) restricts on every plane 
to Q P 2(m) © C p2 (m + 1) . We set 



(21) 

and 
(22) 



f(m) 



9{m) 



1 (\ 



- -(m + 4)(m + 3)(m + 1) 



3 V2 



m 



We check that f(m) — f(m — 1 
Then we proceed exactly like in (i). 



g{m) and the initial case holds: /(0) = 1. 
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The proof of (iv) is very similar to the proof of (ii). So let 

/ \ ,/ v l/ /9 . /m + 3 

(23) g\m) = - I (m 2 + 3m + 4) + f g 

and let f(m) = [§ + 4)(m + 3)(m + 1) - ^ - f - 8^J. Then one checks 

that f'(m) — f'(m — 1) = g'(m) and the initial case holds: f'(2) = 11. 

□ 

The following theorem generalizes Thm. 13.51 to the case n > 3. Again, when 
d is even, use Algorithm [2J In the following theorem we set E = A n ~ 1 Q(m) = 
Q*(m + 1). Note that the pentahedral example of 14.41 corresponds to n = 3 
and m = 1. So we have the maps Af\ H°(Q*(m + 1)) — > H°(Q(m))* and 
P/ : Hom(S m V, A™ -1 ^) ->■ Hom(V, S m F). 

Theorem 5.4. Suppose n > 3 and se£ d = 2m + 1. Lei / = X^=i w f ^ e a / orm 
m S^V o/ ranA; r such that f is general among the forms in S d V of rank r, let 
z% = [vi] G P(V) be the corresponding points and let Z = {zi, . . . , z r }. 

(1) Let Z' be the set of common eigenvectors (up to scalars) ofkerPf. If n 
is even and r < ( m ^ n ) , then Z' agrees with Z. Moreover Algorithm^ 
produces the unique Waring decomposition of f. 

(2) If n is odd and r < ( m ^ n ) let Z' be the set of common eigenvectors (up 

to scalars) o/kerPj and (im Pf) . Then Z' = Z and Algorithm^ with 
this modification, produces the unique Waring decomposition of f . 

(3) Ifn = 3 andr < | (§(m + 4)(m + 3)(m + 1) - *f - f - 8) let Z' be the 

set of common eigenvectors (up to scalars) ofkeiPf. Then Algorithm^ 
(with a = 2) produces the unique Waring decomposition of f . 



Proof. First notice that by Thm. 12.31 we know that in these cases we have a 
unique decomposition. 

Now we prove (1). By |L011l §7] and [LOlll Theorem 1.2.3] we have that the 
natural map 

(24) H\/S(Q ®lz(m)) ® H\K~ a Q ®lz(m))^H°(l z2 (2m+ I)) 

is surjective. Moreover, the dimension of the image of the map in (124)) is equal to 
the rank of the normal bundle of the variety cut out by the minors of size ( n )r+l 
of Pf (where / is considered here as a polynomial with variable entries). By the 
Alexander-Hirschowitz Theorem [272] the space if°(X^2(2m + 1)) has the expected 
codimension r(n + 1) in the space H (O(2m + 1)), hence the rank of the normal 
bundle is the expected one (™ +2 ™ +1 ) —r(n + 1) and it follows that the dimension 
of the scheme cut out by the minors is the expected one r(n + 1) — 1 at [/]. In 
particular this scheme has a reduced irreducible component containing [/] which 
is the r-secant variety to the d- Veronese embedding of P n . It follows that the 
scheme cut out by minors is smooth of the expected dimension at /, hence we 
have the equality rk(Pj-) = ( n )r = rk(/\ a Q) -rk(/), otherwise, if rk(Pj) is smaller, 
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then the variety cut out by minors should have been singular at [/] . Note that in 
[LOU} Theorem 1.2.3] it was fixed the value a = [n/2\, but we may apply it as 
well in the case a = \n/2] because we get just the transpose map. If n is even, 
then the symmetry of Pf guarantees that ker Pf = (imPj-)" 1 . By Thm. I4.4[ with 
E = /^Q and L = 0(2m + 1), we get that Z = Z' (the assumptions of Thm. 14.41 
are satisfied by Thm. 12. 3j) . The proof of (2) is analogous. 

The proof of (3) follows the same lines of the proof of Thm. 13.51 but we use 
Thm. EH] at the place of Thm. EU □ 

Remark 5.5. An interesting case is n = 4, d = 3, r = 7, a defective case addressed 
in [Qtt09j . Set V = C 5 , pick a general / = £)T =1 vf with Z = {[v x ], . . . , [v 7 ]} and 
construct Pf. Hom(V, /\V) -> Hom(A V,V). The locus Z' of common eigen- 
vectors of ker Pf is the unique rational quartic curve in FV passing through Z. 
Note that a general element in Hom(V, /\ V) has no eigenvectors, in agreement 
with Thm. 13.41 In more geometric terms, this means that all sections of f*\Q(l) 
vanishing on Z also vanish on Z'. This follows easily by the construction and by 
Terracini Lemma. So, as a byproduct of Algorithm [5j we have found an algo- 
rithm to write down the unique rational quartic curve through 7 general points 
in P 4 . According to Ranestad-Schreyer (that we quote) the uniqueness is by 
Castelenuovo, (see the proof of Prop. 5.2 in their paper). 

6. Using Chern classes to count the number of eigenvectors of a 

general tensor 

To a vector bundle E on an algebraic variety X are associated its Chern classes 
Ci(E) G A l (X), where A(X) is the Chow ring of X. The reader unfamiliar with 
Chern classes may wish to consult [Har77j or |OSS80j . For vector bundles on 
PV, we have <A l (PV) = Z and the Chern classes can be considered as integers. 

The basic principle that we will use is the following. If a vector bundle E of 
rank r on a variety X has a section vanishing on Z, and the codimension of Z is 
equal to r, then the class of \Z\ G A r {X) is computed by [Z] = c r (E). 

The following proposition is a particular case of a more general result, proved 
recently by Cartwright and Sturmfels with toric techniques |CSllaj . who proved 
a conjecture stated in |NQWW07] , whereProp. 16. II was proved in the case m odd. 

Proposition 6.1. (Cartwright- Sturmfels) The number of eigenvectors (counted 
with multiplicity) of a general M G Hom(S m V, V) is equal to 

m n+\ _ x 

m — 1 

Proof. By Lemma 13771 the number of eigenvectors of a general M G Hom(S' m V, V) 
is equal to c n (Q(m)). In order to compute this number we use the formula (see 
|OSS80] section 1.2) 

c n (E ® L) = J2 ^ ~ l ) c t (E)ci(L) n - 1 
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for a vector bundle E and a line bundle L. 
In our case we have 

1 — m n+l 



c. 



,(Q(m)) = ^2 c i {Q)c 1 {0{m)) n ~ i = ^Tm"" 



1 — m 

i=0 i=0 



where we have used the well known fact that Ci(Q) = 1, which follows immedi- 
ately from the exact sequence 

— >0(-l) — >0®V — >Q — >0. □ 

Proposition 6.2. The number of eigenvectors (counted with multiplicity) of a 
general M £ B.om(S m V, f^~ 1 V) is equal to 

(m + l) n+1 + (-l) n 
m + 2 

Proof. By Lemma the number of eigenvectors of a general M £ Hom(S' m V / , A"" 1 
is equal to c n (/\ n_1 Q(^)) = c n (Q*(m + 1)), which can be computed in a sim- 
ilar way by noting that = (—1)*. To follow the previous proof, we set 
m' = m + 1 . 



c n (Q*(m')) = ^Q(Q*) Cl ((9(m')) n " 1 = J>r 



i=0 j=0 



= y-(- my -' = (-ir 1 - <- m '>" +1 = < m ')" +1 + □ 

^—^ 1 + m' m' + 1 

7. Macaulay 2 Implementation 

The tensor decomposition algorithms in this article could be implemented in a 
variety of computational algebra or computational linear algebra packages. We 
chose to implement our algorithms in Macaulay 2 because we found many of 
the procedures we would need were already implemented. Our algorithms may 
be easily adapted to other languages, depending on the desired features. Our 
code may be found online with the ancillary materials accompanying the arXiv 
version of our paper or by contacting either author. In this section we have tried 
to use the verbatim text style to indicate Macaulay2 input. 

7.1. The catalecticant algorithm implementation. Our first example is an 
implementation of the catalecticant algorithm (Algorithm |2]). For this exam- 
ple we work with degree d polynomials on n + 1 variables. This is the file 
"cat_method.m2" . Our experiments show that working over a prime charac- 
teristic base field, or also the rational numbers, for relatively small d and n, 
the catalecticant method quickly computes the decomposition within the range 
of Thm. 12.4} an d sometimes when the degree is low, even succeeds for slightly 
higher ranks than predicted by the bound. For this implementation the user can 
change the values of degree d and projective dimension n as well as the rank s 
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of the test polynomial at the beginning of the file. The ring R can be either over 
the rationals or over a prime characteristic field. 

A polynomial f f over ground field KK = ZZ/p or QQ is constructed as the sum 
of s d th powers by the following. 

R = KK [x_0 . . x_n] 

f f =sum(s , i-> (random (1 ,R) ) ~d) 

Next, we construct a map that computes the "most square" catalecticant ma- 
trix associated to a given input polynomial. We can do this conveniently by 
using ceiling and floor commands to define the degrees, the basis command to 
define vectors of appropriate sizes for what would be the labels of the rows and 
columns of the matrix and the dif f command to construct the matrix. 

af = floor(d/2) 
ac = ceiling(d/2) 
xaf = basis (af,R) 
xac = basis(ac,R) 

catalecticant=f ->dif f (transpose xac , dif f (xaf , f ) ) 

The next step is to find the base locus of the kernel of the catalecticant matrix. 
First we compute the generators of the kernel, then we convert these integer vec- 
tors to an ideal of polynomials using the basis of the base space, and finally we 
decompose the radical of the ideal by first computing the saturation (this some- 
times results in a speed-up, and is justified because the saturation has the same 
scheme-theoretic structure, and this is all that concerns us with this application). 

K =gens kernel catalecticant ff 

I = ideal (xaf *K) 

L = decompose saturate I 

The list L contains the apolars of the linear forms which will (up to scale) be 
used to write the decomposition. 

Next we construct a ring S by appending constants c_i whose number is the 
number of points in the base locus of the kernel. 

S = KK[x_0. .x_n,c_0. .c_ (length L -1)] 

We construct the polynomial f c = Y2 V e L c i v h where the v* are the apolar forms 
to the Vi. We accomplish the swap between a form its apolar form within the 
summation as follows. 

bS = sub(basis(l,R) ,S) 
fc = sum (length L,i-> 

c_i* ( (bS* (mingens kernel diff(bS, transpose mingens sub(L_i,S))))_(0,0))~d ); 

Next we solve the linear system on the c_i obtained by setting equal to zero 
all d th derivatives of the expression f c-ff . 

Ic= ideal (sub (dif f (basis(d,R) ,ff ) ,S) - diff (sub(basis(d,R) ,S) ,f c)) ; 
Vc = decompose saturate Ic 
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Finally we substitute the found values for the c_i into the polynomial f c check 
to see if the decomposition succeeded. 

FF =sub (substitute (f c,S/Vc_0) ,R) 
FF-ff 

Our tests succeeded to find decompositions for the following n, d, and s for 
example. 

— n=2: (d=3, s=l) , (d=4, s<=4) , (d=5, s<=4) , (d=6, s<=8) 

— n=3: (d=3, s=l) , (d=4, s<=7) , (d=5, s<=7) , (d=6, s<=16) 

— n=4: (d=3, s=l) , (d=4, s<=10) , (d=5, s<=10) , (d=6, s<=16) . 

7.2. The Koszul flattening algorithm implementation. Our second exam- 
ple is the implementation of the Koszul flattening algorithm (Algorithm |5]). This 
is contained in the file "General Kappa Method. m2" . 

As before we tested our algorithm by taking a sum of a fixed number s of powers 
of random linear forms, expanding the resulting polynomial, and then testing to 
see if our algorithm gave the correct decomposition. Here we will describe the 
aspects of this algorithm that differ from the catalecticant algorithm. 

As before, we construct a map that computes the "most square" catalecticant 
matrix associated to a given input polynomial of degree d-1. The degree drops 
because we will eventually feed this map the first partial derivatives of our input 
polynomial f f . Then we construct the Koszul complex. 

M = ideal (basis (1,R)) 
RM = resolution M 

Using the dif f command again, we take a matrix from the Koszul complex 
and construct a block matrix, replacing each entry in the Koszul matrix with 
the catalecticant of the partial derivative of our test function with respect to 
the entry in the Koszul matrix. This matrix is our Koszul flattening, where 
ka indicates which map in the Koszul complex we are using. In this case 
ka=n+l-ceiling(n/2) . The matrix K corresponds to the map called Pf in this 
paper. 

K = dif f (transpose RM.dd_ka, catalecticant ff) 

The base locus of the kernel of the Koszul Flattening K is a set of (generalized) 
eigenvectors. Later we will construct this base locus, for now we compute the 
generators of the kernel of the Koszul Flattening. 

KM = generators kernel K 

The kernel of the Koszul flattening should be a vector space of polynomials, 
but at present it is expressed as integer vectors. The Koszul flattening K is an ax a 
blocked matrix (in the case n even, with small variations in the odd case, when 
the matrix is no longer square) of mxm blocks where a = binomial (n+1 , ka) and 
m=binomial(n+d-ac-l,n). The kernel of K respects this block structure. So we 
convert each blocked integer vector in the kernel into a smaller vector where 
each of the a blocks becomes a polynomial written in the basis of monomials 
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previously defined at xaf . Macaulay 2 can do these computations simultaneously 
on matrices and not just individual vectors. 

for i from to a-1 do { G_i =submatrix(KM, {i*m. . (i+l)*m-l},) ;} 
pG =xaf* G_0; for i from 1 to a-1 do { pG = pG I |(xaf*G_i);} 

The outcome pG is a matrix, each row of which is a vector of polynomials in the 
kernel of K. 

The kernel of K is a subspace of Hom(S , ^ ca V, V), and the zero-set of the a x a 
minors of the matrix representing a map in are generalized eigen- 

vectors. Therefore, for each basis vector of the kernel of K, we construct an ideal 
of a x a minors. Since we are interested in the common generalized eigenvectors 
to all basis vectors of the kernel of K, we construct an ideal J which is generated 
by all of the a' x a' minors we constructed as follows: a'= binomial (n,ka) + l. 

J= ideal (0*x_0) ; 

for i from to r-1 do 

J= J + minorsCa' ,RM.dd_(ka+l) | transpose (submatrix(pG , ,{i}))) ; 

Next we want to compute the zero-set of the the ideal J above. In order to 
save time, we first compute the saturation of the ideal since an ideal and its 
saturation have the same zero-set (scheme). 

L = decompose saturate J 

The list L consists of linear forms which are generalized eigenvectors in the kernel 
of K. The solutions in L are the polar forms to those that we want. The rest of 
the implementation is identical to that of the catalecticant algorithm implemen- 
tation. 

This example code succeeds for the following initial parameters: 

— n=2, (d=3, s<=3) , (d=4, s<=3) , (d=5, s<=7) , (d=6, s<=7) , 
— n=3, (d=3, s<=5}), (d=4, s<=5) , (d=5, s<=ll),(d=6, s<=ll) , 
— n=4, (d=3, s<=6), (d=4, s<=6) , (d=5, s<=14) . 

We were able to test that over prime characteristic, all of these cases are sharp 
except for the case n=4, d=5, as in this case the algorithm slowed considerably 
as s grew. Note for example that when n=3, d=5, and s=8 there are rationality 
problems. 

Remark 7.1 (Remark on numerical methods for inexact solutions and practical 
issues) . Many of the bounds in the examples we have presented could be improved 
if we allowed for irrational or complex solutions to our systems of polynomials. In 
addition, we could succeed in treating the generic case if we were to use numerical 
eigenvector methods on the Koszul flattening and to proceed by declaring the 
kernel of K to consist of those eigenvectors of K which are associated to small 
eigenvalues. We believe that the algorithms we have presented are well suited to 
such adaptations, however we leave this to future study. 
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